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Abstract 

Multifractal processes are a relatively new tool of stock market analysis. Their 
power lies in the ability to take multiple orders of autocorrelations into account 
explicitly. In the first part of the paper we discuss the framework of the Lux model 
and refine the underlying phenomenological picture. We also give a procedure of 
fitting all parameters to empirical data. We present a new approach to account for 
the effective length of power-law memory in volatility. The second part of the paper 
deals with the consequences of asymmetry in returns. We incorporate two related 
stylized facts, skewness and leverage autocorrelations into the model. Then from 
Monte Carlo measurements we show, that this asymmetry significantly increases 
the mean squared error of volatility forecasts. Based on a filtering method we give 
evidence on similar behavior in empirical data. 
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1 Introduction 



The past decades of quantitative analysis of stock market data have produced 
several fruitful concepts and predictive methods. After the groundbreaking 
works of Bachelier [1] and with the early discovery of the broad distribution 
of asset returns [2], the application of additional insights from physics have 
become possible [3,4,5]. Similarly to the case of fully developed turbulence 
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[6], multifractal models [7,8,9,10] catch important aspects of price patterns, 
that the traditional ARCH-GARCH [11,12,13] framework cannot [7]. Scaling 
methods [14,15,16] can be used to quantify the autocorrelation properties of 
these patterns, and the results can be taken into account explicitly. Based on a 
discrete time stochastic volatility framework, one can construct powerful and 
flexible random processes, which have great applicability to forecasting. 

Present paper deals with the applications of the model introduced in [8]. In 
the following we will refer to this as the Lux model, because we follow exactly 
the framework developed by Thomas Lux. We must note, that based on the 
paper by Mandelbrot, Calvet and Fisher [7], a similar model was proposed by 
Calvet and Fisher [17]. Section 2 facilitates notations and describes the main 
important stylized facts of stock prices. Section 3 introduces discrete time 
stochastic volatility models and the Lux model of stock returns. We give some 
additional insights into the underlying phenomenology, rooted in empirical 
findings [18]. Section 4 gives one method of parameter approximation and 
discusses limitations of the Lux model. Section 5 deals with the extension of 
the model to account for skewness and leverage autocorrelations [19,20,21,22]. 
Based on a corresponding filtering of the data and Monte Carlo measurements 
we evaluate the effect of asymmetry on volatility forecasting. 

The calculations were compared to an analysis of real financial data. We ana- 
lyzed the Deutsche Aktienindex (DAX) in the period Jan. 1999 to Dec. 2001. 
The data includes transactions before opening and after closing. The sam- 
pling interval is 15 seconds. The series consists of 1829545 data points. We 
also analyzed data of the 200 most liquid (traded in most minutes over the 
interval) stocks at New York Stock Exchange (NYSE200). The data is based 
on the TAQ database [23], and it includes the period Jan. 2000 - Sep. 2002. 
The sampling interval is 1 minute, with a total of 266966 data points. 



2 Empirical properties of stock returns: stylized facts 

Over the last century of stock market research and especially in the last few 
decades, a number of universal features regarding stock market price dynamics 
have become apparent [24,25,26,27]. These so-called stylized facts are (at least 
qualitatively) independent of the market, the asset traded 1 and even the time 

1 There are some notable differences between stocks, commodities and foreign ex- 
change data, however these are beyond the scope of our paper. In the following we 
will limit ourselves to the discussion of stock price properties, although most obser- 
vations hold for various markets. One remarkable difference is the lack of long-term 
up-down asymmetry in foreign exchange price variations, a consequence of a greater 
symmetry than usual stock versus currency valuation [27]. 
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period, although their quantitative value might vary due to inevitable insta- 
tionarities. 



Let Y(t) denote the price of a financial asset at time t. We define returns as 

r At (t)=Y(t + At)-Y(t). (1) 

The logarithm of the price will be denoted as p(t) = log(F (£)). The logarithmic 
returns over a time horizon At are then defined as 

x At (t) = p(t + At) - P (t) ee log ( pTff^ J ■ ( 2 ) 

The first major group of stylized facts of our concern is related to the prob- 
ability distribution of log-returns x At (see also Figure 4). Classical stylized 
facts for short (minute-scale) At are that the center of the distribution (den- 
sity function) behaves like a Levy distribution (f(r) oc r -1 ), and that the tails 
are truncated according to f(r) oc r~ a with a G [3,6] [18,28,29,30]. This is 
the so-called "inverse cubic law". The tail exponent for DAX was found to 
be a ~ 3.79 (see also Figure 4). This decay is much slower, than expected 
from a Brownian approximation: many-a events (large price changes) are not 
uncommon. These heavy tails can be characterized by the kurtosis: 



(x-(x)y- 

KA = / 2 \2 ' ■ ( 3 ) 

Gaussian distribution has a kurtosis of 3, a greater value shows an excess 
weight of large events. At values of a < 5 kurtosis does not exist, and the 
measured values originate from a finite sample effect. The estimate of K4 di- 
verges as the length of the measurement period goes to infinity [18]. 

The distinct scaling behavior of the tails is very useful for applications, because 
that regime corresponds to large events [31], the main sources of financial risk. 
It is notable, that the power-law regime of DAX is observed to be 20<r-s wide, 
that is much greater than usual tails of the Gaussian distribution. We have 
to note on the immense predictive power of this observation. Over the whole 
three years of observation the number of 6<t+ events was approximately 200. 
For a shorter time series given, the data is even more scarce, and we cannot 
give a statistically correct approximation of the large event distribution, unless 
we accept this kind of extrapolation. 

Classical studies also imply, that the distribution has a negative skewness, 
defined as: 

(x- {x)f) 

The presence of negative skewness is called gain-loss asymmetry [27], and it 
is attributed to psychological factors. More recent analyses show [32] that the 



«s = 7-^372 < 0- ( 4 ) 
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skewness is not stationary, but it characterizes the state of the market: it is 
negative when a downward tendency of prices is present, but positive on rally 
days. 

All the above apply to short time scales, nevertheless the distribution is not 
stable. By increasing the At horizon, slowly, with a characteristic time horizon 
of At ~ 10 — 100 days, it converges to a Gaussian. This is called aggregational 
Gaussianity [27,30,33,34]. Consequently, also: k 3 — > 0, — > 3. This conver- 
gence is not unexpected from the central limit theorem, as (x 2 At (t)) exists [27]. 

A second major group of stylized facts is related to the autocorrelation prop- 
erties of the time series. These also display universal features (also with some 
differences between stocks, commodities, and foreign exchange rates [27,35]). 
These properties also appear to be very robust for different time periods in- 
vestigated. The returns are said to have a short memory. This means, that 
their linear autocorrelation decays exponentially: 

CV(r) = (r At (t + r)r At (t)) oc exp (-^j , (5) 

where r is of the order of 10 minutes [28,29]. This, however, by far does not 
mean, that they are independent. Non-linear functions of the returns do exhibit 
long-memory. Regarding our later arguments, we found useful to measure the 
autocorrelations of squared logarithmic returns. This can be given as: 

CAr) = (xUt + r)x 2 At (t)). (6) 

Measurements on NYSE200 show, that for At < 1 day, C x i (r) oc r _/3 , with 
(5 « 0.307. Apparently C x i{t — > oo) = cr 4 , where o 2 = {x At (t) 2 ). Note that 
the previously mentioned aggregational Gaussianity is a stronger statement, 
than just the disappearance of correlations. Nevertheless, this can be one good 
indicator of such a crossover. Other signs could be the breakdown of multi- 
fractal scaling or the disappearance of excess kurtosis. These have a similar 
time scale, but there is no theoretical necessity for all to occur at the very 
same At. In fact there are strong indications, that the time of the crossover 
point depends on the criterion we choose [27,34]. 

There is a strong connection with two more stylized facts: volatility clustering 
and volatility mean reversion. By volatility we will refer to the instantaneous 
standard deviation of log-returns. There is assumed to be a "normal", mean 
level of volatility, that characterizes x over a long time interval. If a difference 
from this average behavior emerges, the alternative size log-returns tend to be 
clustered together, corresponding to greater volatility for a time period. Then 
eventually this excess volatility decays to the mean level. This is the process 
which can be quantified by C x 2(r). 

One can define generalized q-th order autocorrelation functions as in (7), and it 
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is widely recognized, that the related scaling exponents r(q) are a non-linear 2 
function of q. This is called the multifractal nature of logarithmic (and also 
linear) returns [10,27,35,36]. The power-law scaling law holds for very long 
(months to years) time lags [37]. 

c,(At) = (|x At (t)r)ocA^ +1 (7) 

The last stylized fact we would like to mention is the leverage effect 3 , that 
is closely related to gain-loss asymmetry. As reported by several works, for 
positive time lags the volatility-return correlation function is asymmetric and 
decays with time [19,21,22,27,38]: 

Cl(t) = Mtm+TV m _ B(T)K , (T (8) 



Q(At) is the Heaviside function. The decay is usually considered exponential. 
K*(t) = Aexp (— where [38] reports A rts 1.9 and r 69 days for Amer- 
ican stocks. On the other hand these results are usually very noisy, and in 
the pertinent literature we find no indisputable argument for the exponential- 
ly. There are various, qualitatively similar formulas originating from different 
models [22]. In the case of DAX we find that data are just as well fitted by a 
power-law, K*(t) = Bt- a * . 



3 Modeling logarithmic returns 



3.1 Stochastic volatility models 



Logarithmic returns are considered in this framework to originate from the 
product of two time series: 

x A t(t) = A At (t)aS At (t). (9) 

a is the standard deviation of logarithmic returns, in our notations A(t) and 
S(t) are normalized to unit mean square. S(t) is a sign factor, usually assumed 
to be uncorrelated. A(t) is an amplitude factor, usually it is defined to be 
positive, and it is long-time correlated. This latter process models basically 
the volatility. It is the source of volatility clustering, mean reversion and time 

2 With this definition there is a linear relation (also referred to as monofractality) 
for Brownian motion: TBrownian(q) = q/2 — 1. This can be taken as a good reference 
for measurements. 

3 Note that this seriously differs from what traders refer to as leverage. 
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correlations apparent in returns (see Section 2). As S(t) practically determines 
the sign of x only, volatility can be approximated as 

aA At (t) = \x At (t)\. (10) 

The sign process S(t) is mostly modeled by Gaussian white noise, or simply a 
random +1/— 1 value. 

Recent studies have unveiled a strong relationship between trading activity 
and the volatility process. Plerou et al. [18] imply, that on any time scale At, 
where the 9tAt(£) number of trades of the asset in the periods [t, t+At] is much 
greater, than 1 (for liquid stocks they find this limit applies from as short as 
At = 15 minutes), the log- returns of the asset can be well approximated as 

XAt(t) = y/WAt(t)W At (t)N(0, 1). (11) 

9?Ai(0 i s found to be strongly correlated, and to have power- law tails. The 
term W At (t) is the realized volatility, which is only weakly correlated, and also 
has power-law tails. 

3.2 Multifractal modeling 

One can define a broad class of models of logarithmic returns through (10). 
The aim is to find an appropriate A(t) stochastic process, that accounts for 
as many stylized facts as possible. We consider here models which focus on 
the multifractal property (7), because this is a general approach to take mul- 
tiple orders of autocorrelations into account. For practical applications like 
volatility forecasting these play a key role. In the literature, there are three 
relevant multifractal models currently available. The first proposition to use 
a multifractal process as stochastic volatility was the Multifractal Model of 
Asset Return (MMAR) by Mandelbrot, Calvet and Fisher given in a series of 
papers [7,36,39]. A different process, the Multifractal Random Walk (MRW) 
was developed in [9,40,41,42]. Here we discuss in detail a modified version of 
the MMAR model which we call Lux model [8]. 

3.3 The Lux model 

In both MMAR and the Lux model the logarithmic price is assumed to follow 

p(t)=p{0) + B{8(t)), (12) 

where B(t) is an Integral Brownian motion with standard deviation a and 8(t) 
is an increasing function of t, and it plays the role of time-deformation, which 
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trading time [At] trading time [At] 



Figure 1. A visual explanation of the common idea of MMAR and the Lux model. 
The actual values are only to be taken as illustration. Left: p(t) logarithmic price of 
the asset and the 9'{t) increments of trading time. One can see, that the absolute 
value of log-returns is usually larger in periods, where 9'(t) is larger. Right: The 
corresponding 9(t) flow of virtual trading time as a function of real time. 

is well-established in econometric modeling [2,7,43,44]. 

Let 9'(t) denote the increments 9'(t) = 9{t) —0{t — At). The function 9(t) can 
be thought of as a virtual trading time, that increases with the real time t. In 
different periods the passing of 9(t) can be slower or faster (a visual explana- 
tion for the process can be seen in Figure 1). When measured in this virtual 
time, the logarithmic price is assumed to follow a simple Integral Brownian 
Motion, and thereby slower or faster trading (smaller and larger values of 9'(t) 
respectively) correspond to smaller and greater volatility from the real time 
point of view. The finite lengths of these regimes result in volatility clustering. 

To gain deeper insight into the meaning of time-deformation we have to refor- 
mulate (12). We know, that in discrete time B(t) is the sum of independent 
Gaussian increments aS(t) = N(0,a). Discretization of time can be inter- 
preted as having a finite sampling interval of financial data (minutes or days). 
Using the stability of the Gaussian distribution one can give a somewhat gen- 
eralized formalization: 

B(6(t))=aN(0, yj9(t)/At). (13) 
With the above notations we get 4 : 

B{9(t))= £ V^^=iV(0,l). (14) 

r=At,2At...,t VAf 

4 One can see, that we can choose At = 1, without any loss of generality. It is very 
intuitive, that on a different At time scale the process remains the same, only the 
typical size of price changes varies. This is in harmony with the notion of scale- 
invar iance. 
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Equivalent ly for returns: 

x At (t) = ^(t)-^N (0,1), (15) 

where the first term corresponds to the amplitude A& t (t) introduced in (9). 

One can deduce a formal analogy between (15) and (11). The value of the 
6' (i) increments of virtual trading time can hence be related to measurable 
quantities: 

^TKt = V^AtWW^). (16) 

This shows, that 6'(t) does not represent purely a time deformation, such as 
virtual trading time would equal the number of trades. It also contains the 
realized volatility process W^t- 

We can think of 6'(t) both as the square volatility on an equidistant time scale, 
and as a quantity proportional to number of trades in the same period. We 
discussed in Section 2, that volatility has a multifractal property, but analo- 
gously ^Atit) also appears to lack a typical time scale for highly capitalized 
and frequently traded stocks [45] . 

To complete our definition we have to give the random process we wish to use 
for modeling 9(t). It is more convenient to give the increments instead: 

k 

6>(t)=2 k Hm?\ (17) 
i=i 

k can be any integer number, while all rnf 1 are taken from a log-normal distri- 
bution. The trick that facilitates multifractal behavior is that are defined 
to have strong time autocorrelation: 



(l) Jexp(7V(-Aln2,2(A-l)ln2))) 
mUi = \ (l) , (18) 

where the choice is made as follows. We go from i = 1 . . . k, and choose the top 
option with probability 2~ h+t , or if for any previous i we had already chosen 
the top option. We choose the bottom option otherwise. The moments of the 
distribution: 

^ (rnf } ) q/2 ^ = exp(-(g/2)A In 2) exp((g/2) 2 (A - 1) In 2) = 2-^ {q)+l) . (19) 

It is easy to show, that the autocorrelation of any is exponential with 
characteristic time steps 2 fc ~\ On the other hand as one takes the k — > oo 
limit, the autocorrelation of 6'{t) becomes decaying as a power-law. Also for 
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any finite k < oo the process has the multifractal property (7) if At < 2 k , 
with 



the same scaling exponents as for the moments. 
3.4 Issues of causality 

Now we turn to the question of applicability, as it is very important to note on 
a major weakness of multifractal models. The MM AR is said to be not causal, 
because the whole return time series is generated at the same time, as a bino- 
mial cascade [7] . Hence it cannot be used for volatility forecasting. There is no 
way to treat empirical data as the "past" values of such a cascade and generate 
a mean future behavior. The Lux model (and MRW) has no such theoretical 
limitation, these processes can be generated "causally". This means, that if we 
know the present state of the model, we can generate the distribution of future 
conditional volatility step by step. The problem here is how to reconstruct the 
initial conditions of the model from empirical data. For GARCH-type models 
[12] the present state of the market is fully characterized by the past return 
time series and this data is readily available. The application of the Lux model 
however would demand the value of the initial mfs^'"^ multipliers, not only 
their products, which are proportional to returns. Multipliers are not directly 
observable and there is no maximum likelihood estimator available at present. 
One partial success has been achieved by Calvet and Fisher [46] recently, but 
that corresponds to a simplified case. A future advantage of the Lux model 
can be that the number of these state variables is small (k), while for MRW 
this number is untreatably large (and infinite in the ideal case). 



4 The Lux model revisited 

The aim of this chapter is to give an outline of the estimation of the parameters 
of the Lux model. We propose a new method to take the effective length of 
the power-law memory of log-returns into account explicitly. Then we note on 
some incosistencies with real data. 

4-1 Parameter approximation 

Let us make clear what parameters are necessary to (statistically) reproduce a 
given time series. Firstly, a is the standard deviation of log-returns. Secondly, 
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Figure 2. Left: r(q) spectrum of DAX as measured from MF-DFA(l) with the an- 
alytical curve for the fitted Lux model. Top right: Scaling of DAX for moments 
q = —3, —2, — 1, 0.1, 0.3, 0.5 ... 2.5. Bottom right: Scaling of DAX for moments 
q = 2.7, 2.9,... 4.9, 5, 5.5, 6, 6.5, 7. 

parameter A decides the distribution of the mjp multipliers, and through this 
the multiscaling behavior. And finally k is the number of multipliers. In [8] two 
methods are proposed to determine A, but the well-known scaling estimator 
method is found to perform poorly compared with GMM estimation (see also 
[47]). The performance of the scaling approach can be improved substantially 
by using the Multifractal Detrended Fluctuation Analysis (MF-DFA) [15,16] 
procedure instead of simple scaling. A low (first or second) order method 
already eliminates most of the instationarity in the empirical data and the 
resulting spectra have much less bias. Although simulated sample paths of 
the Lux model are stationary, even in this case MF-DFA brings a significant 
improvement. Results for the DAX dataset are shown in Figure 2, Xdax = 
1.13206 ± 0.00156. Typical values are A = 1.01 . . . 1.30. 

It is also important to note, that q > 3 moments of x do not exist in a strict 
sense, as a consequence of the fat tails. The empirically measured exponents 
originate from a finite-size effect, which is one form of apparent multifractality 
[48]. However the inclusion of these moments does not change the fitted A 
values significantly. 

The performance of a GMM estimator proposed by Lux [8,47] is still better 
than MF-DFA by means of bias and root mean squared error. However the dif- 
ference is much smaller, than for the traditional scaling estimator. We would 
like to note, that the typical bias and error in A that arises from MF-DFA 
does not change the actual behavior of the fitted moments significantly, and 
therefore it seems plausible, that a slight misspecification of A does not seri- 
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Figure 3. Left: Empirical (x 2 {t)x 2 {t + r)) values for DAX and the fitted Lux model. 
Right: Empirical (x 2 (t)x 2 (t + r)) values averaged over 200 stocks in NYSE200, and 
possible fits from the Lux model. 

ously decrease forecasting performance. Finally, the scaling method might be 
much more intuitive, and should not be discarded. 

The k number of multipliers is also called cascade depth. As k basically de- 
termines the length of the power-law regime in the memory of the generated 
process, it is straightforward to measure k from autocorrelations. Our choice 
is to use the second order autocorrelations of logarithmic volatility as defined 
in (6). Using combinatorial assumptions Lux [8] gives the exact formula 



(Atya + x» = «* (g (n (i - ^) r ) (i - (i - ^) r ) 

{( m ?r)" (m«) 2 '- 2 " + (l - (l - ^j) {m?f) (,21) 

where the moments can be calculated from (19). This expression can be inter- 
preted to have two qualitatively different regimes. For r < 2 k the correlations 
are power-law, and for r > 2 k they disappear. We observed similar typical be- 
havior in various stocks, and based on this crossover we managed to fit the k 
cascade depth. The procedure for DAX is shown in Figure 3. We also give the 
average of {x 2 (t)x 2 (t + r)) over the 200 stocks of NYSE200. In the empirical 
data one can clearly see peaks corresponding to the daily periodicity, which 
can be filtered out by normalization of the intraday volatility distribution [28]. 
This however does not affect our observation, that the fit from the Lux model 
is very sensitive to the value of k. If we first choose A according to the MF- 
DFA method proposed herein or the GMM estimator of [8], then the above 
procedure determines the value of k. The appropriate choice of k ensures the 
correct length of power-law memory in the simulated time series. 
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Figure 4. Left: Distribution of the logarithmic returns for DAX in the period 
1999-2001, and the corresponding fitted Lux model. Top right: Tail behavior of 
DAX logarithmic returns. The power-law fit gives a tail exponent a 3.79. Bottom 
right: Tail behavior of the Lux model logarithmic returns. The power-law fit gives 
a tail exponent a 4.77. 

Note, that the At size of the elementary time step determines the resolution of 
modeling. The crossover from power-law memory to uncorrelated log-returns is 
qualitatively at 2 k time steps. In a physical interpretation different resolution 
models of the same process (stock) should have the same length of memory, 
thus 2 k At = const. 



4-2 Tail behavior 



If we make a log-return distribution histogram (Figure 4) we can see that the 
TLF-like regime is fitted excellently by the model. Assuming tails are power- 
law, the tail exponent is larger than expected. For DAX a ~ 3.79, while for the 
fitted Lux model it is a ~ 4.77. This results in missing finite-sample kurtosis, 
k dax ^ gQ3^ K Lux ^ 34 _2. This might limit the applicability of the model 
for long-time forecasting, where these many-sigma events come into play. The 
problem is that the value of measured kurtosis depends on the k number 
of multipliers used. We fit k from {x 2 (t)x 2 (t + r))-type autocorrelations, the 
r = value of which is closely related to the kurtosis. To match the r > 
behavior of typical minute-scale data k ~ 10 has to be chosen, while the correct 
r = behavior is given by k ~ 30. There seems to be no solution to this 
problem within the framework of the Lux model, because all the parameters 
of the model are strictly set. It is in principle possible to introduce a new 
parameter or to change the distribution of the multipliers from log-normal to 
give superior results. 



12 



Although visual (see Figure 4) and statistical methods cannot easily distin- 
guish simulation results from empirical power-laws in the tail of the distribu- 
tion, there is good reason to think that the simulated tail should decay faster 
since the random variable results from the product of variables taken from a 
Gaussian and a log-normal distribution. This is the reason why, in constrast 
to the empirical distribution ("inverse cubic law"), the moments (\xAt\ 9 ) exist 
for any q, see (19). 



5 Third moment related effects 

In this section we first introduce two extensions of the Lux model, for up-down 
asymmetry and leverage autocorrelations. Then we evaluate the implications 
of leverage on volatility forecasting. 

The original Lux model does not reproduce the up-down asymmetry: 



(x At (t) 3 ) oc (A(t) 3 S(t) 3 ) = (A(tf) (S(t) 3 ) = (A(t) 3 ) * = 0. (22) 



The two main assumptions for these calculations are that A(t) and S(t) are 
independent of each other, and that S(t) has a vanishing third moment. Re- 
laxing one (or both) of these conditions can be adequate to account for these 
two additional stylized facts. 

5. 1 The extended Lux model 

Returning to the basic idea of stochastic volatility models (9), we can modify 
the sign term in the Lux model. Previously we assumed S(t) = N(0, 1). Since 
the A(t) amplitude factor is non-negative, skewness can only originate from 
the sign. Therefore we introduce a new, discrete random variable: 



Similarly, leverage autocorrelations vanish: 



C L (r) oc (A(t)S(t)A(t + r) 2 S(t + r) 2 ) = 
(A(t)S(t)A(t + r) 2 S(t + rf) (S(t)) = 0. 



(23) 




(24) 



One can show, that the formula 
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(x*(t)) = ((A(t)aS(t)) 3 ) « (A(t) 3 )a 3 (S(t) 3 ) = 

-4ea 3 * exp (^-k(X - 1) In 2^ (25) 

works well for a range of parameters and can be used to fit e to empirical 
data. This gives a mean level of skewness in the simulations. The approach 
shows one typical effect that is inherent in the Lux model (and possibly other 
similar stochastic volatility models). Although we can reproduce the mean 
level of skewness (or leverage autocorrelations as we will see), the model is 
stationary, and consequently the predicted fluctuations are much smaller than 
for empirical data. 

This new source of skewness does not change the time correlation properties of 
the process. Pochart and Bouchaud proposed a new term to introduce leverage 
autocorrelations to their MRW model [20] . Their approach seems to be readily 
applicable to any stochastic volatility model specified as (9). We therefore 
extend the Lux model as follows: 



x(t) = 



\ 



k 

2 k f[ m| l) exp I - £ S(t')K(t - t') ) aS(t). (26) 



i=i \ t'<t 



K(t) can be considered a kernel funcion. It can be basically any kind of de- 
caying function, but later on we will prove, that qualitatively Cl{t) ~ K{r) 
in an appropriate limit (also see [20]). 



One can show analytically, that for 



K 



K(t-t') = Q(r -(t-t')) JT -^ (27) 

in a first order approximation the relation 

[x 2 At (t)) » a 2 At (28) 



remains unchanged. Note that r in (27) is a finite time cutoff in the memory. 
We used it instead of an exponential factor Pochart and Bouchaud [20] use 
to regularize integrals in calculations for the a < 0.5 case, because it makes 
simulations run faster, while (taken large enough) not affecting the results 
significantly. 

In the a > 0.5 case for the third moment we get 

xlt(t)) » -^^exp(Hn2(l-A)/2)At 2 - a (29) 
' 2 — a 
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Combining (27) and (29) gives for skewness: 

Ks(At) = / ftu3/2 w -^-exp(Hn2(l - A)/2)At - 8 -. (30) 



This shows that the addition of this new term also results in the appearance 
of skewness, which decays with time for a > 0.5, and increases for a < 0.5. 
Pochart and Bouchaud [20] also report, that their process loses skewness in 
the continuous-time limit, when the size of the elementary time steps At — > 0. 
This is corrected by our independent skewness parameter e. 

We do not wish to repeat all the calculations of [20] for the case of the Lux 
model, only to indicate, that the explicit form of the stochastic volatility term 
is indifferent to the behavior of leverage autocorrelations. Instead, we focus on 
the applications of this approach. As defined in Section 2, leverage autocorre- 
lations are 

{x{t)x\t + r)) 

C l{t) = 2 • (31) 

(x 2 {t)) 

In empirical data Cl(t) is typically said to decay exponentially [19,38,21]. 
Pochart and Bouchaud argue [20], that the introduction of an exponential 
characteristic time for leverage would destroy the scale invariance of the pro- 
cess, and therefore they treat the power-law leverage autocorrelations as an 
approximation. However, later in this section we find that the power-law also 
well describes data. 

Now in order to see the behavior of leverage autocorrelations, we introduce a 
different K(t — t') kernel function to (26) by analogy to [38]: 

(32) 




We make a first-order approximation for the correlation term 

exp^xXj^Oexp^-^j wl-^E^Oexp^-^. (33) 

This holds, if the argument of the first exponential is small in absolute value. 
Since the series {S(t')} is a Brownian motion, we know that qualitatively 



E s(f) 



{t-t'fl\ (34) 



We can substitute the remaining exponential with 9(t')9(T — t'), to get an 
approximate condition for the weakly correlated limit: 



tfoE 5 (Oexp(--) E 5(0 cx X lv ^ < 1. 

t'<t \ T °J 0<T<T() 



(35) 
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Similar formulas can be given for power-law kernel functions defined in (27). 



Now we return to the notation K{t') to deduce some general results. Given 
the above linearization is justified, leverage autocorrelations can be written as 

(a 2 S(t) 2 ) 2 C l (t) = (x(t)x(t + r) 2 ) = (A(t)A(t + r) V 
S(t + rfaS(t)(l + j:S(t')K(t')] fl+ E S(t")K(t")) ). (36) 

V t'<t J \ t"<t+T J I 

By the independence of some terms, the only contribution from the parentheses 
that remains (with 1, 1, and t" = t terms selected from the sums respectively) 
is 

(a 2 S(t) 2 ) 2 C l (t) = {A(t)A(t + r) 2 S(t + At) 2 ) (2a 3 S(t) 2 K(r)) , (37) 
and finally 

C l (t) = (A(t)A(t + rf) 2a- 1 K(r). (38) 

This is a general result, that holds for any kernel function K(t'), as long as the 
linearization works. To calculate the amplitude autocorrelations, we generalize 
(21) for arbitrary order autocorrelations. Straightforward calculation yields: 

{wt+ ^=g(n(i-^) T ) (i-(i-^) T ) 

(( m f) ( ' ,+W2 )"((KT /2 >(K , ) W2 ))""+ 
( 1 -0-^) T )((KT /2 )<K , ) W2 ))' < 39 > 

Skewness originating from the sign changes leverage correlations in a way, that 
can be calculated from (25): 

C L (r) = (A(t)A(t + r) 2 ) 2a- 1 K{r){l - 2eK{r)). (40) 

The last term is a third order correction and has no significance in prac- 
tice (eK(r = 1) ~ 1CT 2 typically). For the exponential case this expression 
performs similar to those presented in the recent comparison by Perello and 
Masoliver [22]. 

We could find no indisputable evidence for a well-defined, mean leverage in 
NYSE200, only for individual stocks. We found DAX leverage autocorrelations 
to be well fitted by a power-law type kernel, with a ~ 0.384, K « 0.03. Results 
can be seen in Figure 5. From the comparison of the smoothed graphs one can 
see, that the mean behavior of the model and the original time series is very 
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DAX (smoothed) 

Lux model (smoothed) 



-0.2-1 — r 

500 1000 1500 2000 -0.4 j 

t [15 sec] o 500 1000 1500 2000 

x [15 sec] 

Figure 5. Left: Measured leverage autocorrelations of DAX and a sample path with 
fitted parameters. Noise was removed by a 50-neighbor averaging. Top right: Lever- 
age autocorrelations of DAX without smoothing. Bottom right: Leverage autocor- 
relations of simulated sample path fitted to DAX, without smoothing. 

similar. However if we do not remove the noise by smoothing, it can be seen, 
that the empirical fluctuations around the mean value are much larger than the 
Lux model would predict. Part of this effect can originate from that the data 
was unfiltered. The predictable intra-daily trends [28] of volatility that are 
not reproduced by the model can account for increased fluctuations. Further 
investigation is possible, here we would only like to note on the fact, that there 
is no a priori reason why the Lux model should give the right size of fluctua- 
tions. Possibly a higher inherent level of noise is present in more complicated, 
agent-based simulations. In our opinion empirically observed fluctuations are 
larger, because of the inevitable instationarity of the price process. This effect 
seems to be similar to that reported in [32]. Third- moment related quantities 
are instationary, and this might limit their applicability to forecasting. 



5.2 The role of leverage in volatility forecasting 

At present maximum likelihood type estimators for our volatility forecasts are 
unavailable. As noted in Section 3.4, a partial success has been achieved in 
[46], but the results are not readily applicable to complicated cases like the 
Lux model or MRW. Lux [8] proposes the use of a Levinson-Durbin algorithm 
(algorithm 6 of [49]) instead of maximum likelihood. This can be proven to be 
the best linear predictor, and it takes power-law autocorrelations of logarith- 
mic returns into consideration. A simple formalization, consistently with our 
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Figure 6. Comparison of the mean sqared error of volatility forecasts by historical 
volatility and Levinson-Durbin (m = 2000), when the underlying time series is 
generated by a Lux model including and not including leverage. The approximation 
is given with 30000 sample runs, model parameters are k = 12, A = 1.1, Kq = 0.03, 
a = 0.384. The results shown are from a 5-neighbor smoothing of a h = 1, 2, . . . , 50 
series. 

notations would be: 

m— 1 

i 2 (<H)=E^- i ), (41) 

i=0 

where x 2 = x 2 — a 2 , a is the standard deviation of log-returns and cfrf^ are 
constants determined by the set x 2 (t — m + 1 . . . t). For explicit formulas see 
[49]. 

This method ignores the sign process. Now we will show, that the sign infor- 
mation and in particular the leverage effect are indeed relevant to forecast- 
ing performance. We carried out measurements to compare the (theoretical) 
mean squared errors of different methods in the presence of leverage auto- 
correlations. We did this by running 30000 Monte Carlo iterations with sam- 
ple paths simulated by the Lux model. The parameters were fitted to DAX 
(A = 1.13205, k = 15), but we also used the respective values of leverage 
(Kq = 0.03, a = 0.384). The results (Figure 6) show that the effect of asym- 
metry is not negligible from the point of view of volatility forecasting, because 
the accuracy of the forecasts is decreased. Actually the loss of accuracy caused 
by leverage is roughly the same as the gain from historical volatility to the Lux 
model. This shows, that leverage autocorrelations do not only produce asym- 
metry in volatility smiles, but are also relevant for the predictions of future 
conditional volatility. Note that this loss also applies to historical volatility, so 
it does not affect the relative performance of the two methods. 



18 




[15 sec] x [15 sec] 



Figure 7. Left: Leverage of natural logarithm of DAX. Right: Leverage of natural 
logarithm of DAX after the application of the leverage filter. K(t — t') = ( - f _°;^ 384 . 
The insets show the time series. 



Let us now consider the empirical time series to originate from any stochastic 
volatility model (9), where S(t) is a discrete, symmetric (e = 0) sign process. 
It is straightforward to construct a simple filter to remove the leverage effect 
from the logarithmic returns: 

x(t) = x(t) exp ^£ WX* ~ j > ( 42 ) 

where x(t) is the original set of log-returns, K(t — t') is the leverage kernel 
function fitted to the data, and S(t') = sgn (x(t')). Figure 7 shows the leverage 
of the natural logarithm of DAX and its filtered counterpart, with the insets 
showing the actual time series. After the application of the filter, the resulting 
data are up-down symmetric, except for the drift of prices. This property 
improves the performance of both historical volatility and Lux model-based 
forecasts, which originally both omit sign information. The results are given 
in Figure 8. The comparison between forecasts on the filtered and unfiltered 
data shows that there is a reasonable gain in accuracy. The average decrease 
of error 5 is comparable to the improvement from historical volatility to the 
Lux model. This underlines the theoretical expectations (Figure 6). 



5 These results are related to the performance of a procedure, that first filters 
out past leverage autocorrelations, then makes a forecast based on this data, and 
then restores leverage autocorrelations. The only difference is that to restore future 
leverage, we would have to use "future" sign information. This can be formally 
resolved by taking S(t' > t) = 0. Further research will be necessary to evaluate this 
in detail. 
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Figure 8. Left: performance of the linear forecast versus historical volatility for the 
leverage filtered DAX series. The inset shows the mean squared error values with a 
5-neighbor smoothing. Right: comparison of linear forecasts for the filtered and the 
unfiltered data. The inset shows the mean squared error values with a 5-neighbor 
smoothing. All results are normalized with the respective standard deviations. 

6 Conclusion 



In this paper we have reviewed the generalized stochastic volatility frame- 
work used to introduce multifractal models of stock prices. We focused our 
investigations on the Lux model [8] and we have shown one efficient proce- 
dure to fit the model parameters to actual data. We have given arguments, 
that the effective length of the power-law memory (one stylized fact that can 
be measured in practice) in the model is governed by the parameter k. We 
have introduced an extension of the Lux model to give up-down asymmetry, 
and also adopted the approach of Pochart and Bouchaud [20] to account for 
leverage autocorrelations. This extended model reproduces the mean behavior 
of real stock market data well, but not the large fluctuations resulting from 
instationarity. We have introduced a filter to remove this leverage from the 
data. Filtered DAX data and Monte Carlo simulations both show, that the 
presence of asymmetry increases mean squared error of forecasts that omit 
sign information. Consequently, it can be useful to extend the models in use, 
taking leverage into consideration with the hope of substantial gain in accu- 
racy. Further research should aim to produce new techniques, that use the 
available sign information efficiently to take advantage of our findings. 
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